Green Tissue-Preferred Promoter from Maize

ABSTRACT

The present invention provides compositions and methods for regulating expression of heterologous nucleotide sequences in a plant. Compositions include a novel nucleotide sequence for a green tissue-preferred promoter from maize. A method for expressing a heterologous nucleotide sequence in a plant using the promoter sequences disclosed herein is provided. The method comprises stabling incorporating into the genome of a plant cell a nucleotide sequence operably linked to the root-preferred promoter of the present invention and regenerating a stably transformed plant that expresses the nucleotide sequence.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Application No.61/306,042, filed Feb. 19, 2010, which is hereby incorporated herein inits entirety by reference.

FIELD OF THE INVENTION

The present invention relates to the field of plant molecular biology,more particularly to regulation of gene expression in plants.

BACKGROUND OF THE INVENTION

Recent advances in plant genetic engineering have enabled theengineering of plants having improved characteristics or traits, such asdisease resistance, insect resistance, herbicide resistance, enhancedstability or shelf-life of the ultimate consumer product obtained fromthe plants and improvement of the nutritional quality of the edibleportions of the plant. Thus, one or more desired genes from a sourcedifferent than the plant, but engineered to impart different or improvedcharacteristics or qualities, can be incorporated into the plant'sgenome. New gene(s) can then be expressed in the plant cell to exhibitthe desired phenotype such as a new trait or characteristic.

The proper regulatory signals must be present and be in the properlocation with respect to the gene in order to obtain expression of thenewly inserted gene in the plant cell. These regulatory signals mayinclude a promoter region, a 5′ non-translated leader sequence and a 3′transcription termination/polyadenylation sequence.

A promoter is a DNA sequence that directs cellular machinery of a plantto produce RNA from the contiguous coding sequence downstream (3′) ofthe promoter. The promoter region influences the rate, developmentalstage, and cell type in which the RNA transcript of the gene is made.The RNA transcript is processed to produce messenger RNA (mRNA) whichserves as a template for translation of the RNA sequence into the aminoacid sequence of the encoded polypeptide. The 5′ non-translated leadersequence is a region of the mRNA upstream of the protein coding regionthat may play a role in initiation and translation of the mRNA. The 3′transcription termination/polyadenylation signal is a non-translatedregion downstream of the protein coding region that functions in theplant cells to cause termination of the RNA transcript and the additionof polyadenylate nucleotides to the 3′ end of the RNA.

Expression of heterologous DNA sequences in a plant host is dependentupon the presence of an operably linked promoter that is functionalwithin the plant host. The type of promoter sequence chosen is based onwhen and where within the organism expression of the heterologous DNA isdesired. Where expression in specific tissues or organs is desired,tissue-preferred promoters may be used. Where gene expression inresponse to a stimulus is desired, inducible promoters are theregulatory element of choice. In contrast, where continuous expressionis desired throughout the cells of a plant, constitutive promoters areutilized.

An inducible promoter is a promoter that is capable of directly orindirectly activating transcription of one or more DNA sequences orgenes in response to an inducer. In the absence of an inducer, the DNAsequences or genes will not be transcribed or will be transcribed at alevel lower than in an induced state. The inducer can be a chemicalagent, such as a metabolite, growth regulator, herbicide or phenoliccompound, or a physiological stress directly imposed upon the plant suchas cold, heat, salt, drought, or toxins. In the case of fighting plantpests, it is also desirable to have a promoter which is induced by plantpathogens, including plant insect pests, nematodes or disease agentssuch as a bacterium, virus or fungus. Contact with the pathogen willinduce activation of transcription, such that a pathogen-fightingprotein will be produced at a time when it will be effective indefending the plant. A pathogen-induced promoter may also be used todetect contact with a pathogen, for example by expression of adetectable marker, so that the need for application of pesticides can beassessed. A plant cell containing an inducible promoter may be exposedto an inducer by externally applying the inducer to the cell or plantsuch as by spraying, watering, heating, or by exposure to the operativepathogen.

A constitutive promoter is a promoter that directs expression of a genethroughout the various parts of a plant and continuously throughoutplant development. Examples of some constitutive promoters that arewidely used for inducing the expression of heterologous genes intransgenic plants include the nopaline synthase (NOS) gene promoter,from Agrobacterium tumefaciens, (U.S. Pat. No. 5,034,322), thecauliflower mosaic virus (CaMv) 35S and 19S promoters (U.S. Pat. No.5,352,605), those derived from any of the several actin genes, which areknown to be expressed in most cells types (U.S. Pat. No. 6,002,068), andthe ubiquitin promoter, which is a gene product known to accumulate inmany cell types.

Additional regulatory sequences upstream and/or downstream from the corepromoter sequence may be included in expression constructs oftransformation vectors to bring about varying levels of expression ofheterologous nucleotide sequences in a transgenic plant. Geneticallyaltering plants through the use of genetic engineering techniques toproduce plants with useful traits thus requires the availability of avariety of promoters.

In order to maximize the commercial application of transgenic planttechnology, it is important to direct the expression of the introducedDNA in a site-specific manner. For example, it is desirable to producetoxic defensive compounds in tissues subject to pathogen attack, but notin tissues that are to be harvested and eaten by consumers. Bysite-directing the synthesis or storage of desirable proteins orcompounds, plants can be manipulated as factories, or productionsystems, for a tremendous variety of compounds with commercial utility.Cell-specific promoters provide the ability to direct the synthesis ofcompounds, spatially and temporally, to highly specialized tissues ororgans, such as roots, leaves, vascular tissues, embryos, seeds, orflowers.

Alternatively, it might be desirable to inhibit expression of a nativeDNA sequence within a plant's tissues to achieve a desired phenotype.Such inhibition might be accomplished with transformation of the plantto comprise a tissue-preferred promoter operably linked to an antisensenucleotide sequence, such that expression of the antisense sequenceproduces an RNA transcript that interferes with translation of the mRNAof the native DNA sequence.

Constitutive expression of some heterologous proteins, such asinsecticides, leads to undesirable phenotypic and agronomic effects.Limiting expression of insecticidal proteins, for example, to the targettissues of insect feeding, allows the plant to devote more energy tonormal growth rather than toward expression of the protein throughoutthe plant. Using tissue-preferred promoters, one can also limitexpression of the protein in non-desirable portions of the plant.However, many of the tissue-preferred promoters that have been isolateddo not direct the expression of sufficient amounts of transgene forefficacy in plants. Thus, the isolation and characterization oftissue-preferred promoters that can direct transcription of asufficiently high level of a desired heterologous nucleotide sequence isneeded.

Since the patterns of expression of a chimeric gene (or genes)introduced into a plant are controlled using promoters, there is anongoing interest in the isolation and identification of novel promoterswhich are capable of controlling expression of a chimeric gene or(genes).

SUMMARY OF THE INVENTION

Compositions and methods for regulating gene expression in a plant areprovided. Compositions comprise novel nucleotide sequences for aconstitutive promoter that initiates transcription in a greentissue-preferred manner. More particularly, a transcriptional initiationregion isolated from maize is provided. Further embodiments of theinvention comprise the nucleotide sequence set forth in SEQ ID NO:1.This plant promoter sequence was deposited with the AgriculturalResearch Service (ARS) Culture Collection, housed in the MicrobialGenomics and Bioprocessing Research Unit of the National Center forAgricultural Utilization Research (NCAUR), under the Budapest Treatyprovisions, and assigned patent deposit number NRRL-B-50179. Thecompositions of the embodiments further comprise nucleotide sequenceshaving at least 95% sequence identity to the sequences set forth in SEQID NO: 1, and which drive green tissue-preferred expression of anoperably linked nucleotide sequence. Also included are functionalfragments of the sequence set forth as SEQ ID NO: 1, which drive greentissue-preferred expression of an operably linked nucleotide sequence.

Compositions of the present invention also include DNA constructscomprising a promoter of the embodiments operably linked to aheterologous nucleotide sequence of interest wherein said promoter iscapable of driving expression of said nucleotide sequence in a plantcell and said promoter comprises the nucleotide sequences of the presentinvention. The embodiments further provide expression vectors, andplants or plant cells having stably incorporated into their genomes aDNA construct mentioned above. Additionally, compositions includetransgenic seed of such plants.

Methods of the embodiments comprise a means for selectively expressing anucleotide sequence in the green tissues of a plant, comprisingtransforming a plant cell with a DNA construct, and regenerating atransformed plant from said plant cell, said DNA construct comprising apromoter and a heterologous nucleotide sequence operably linked to saidpromoter, wherein said promoter initiates green tissue-preferredtranscription of said nucleotide sequence in a plant cell. In thismanner, the promoter sequences are useful for controlling the expressionof operably linked coding sequences in a green tissue-preferred manner.

Downstream from and under the transcriptional initiation regulation ofthe promoter will be a sequence of interest that will provide formodification of the phenotype of the plant. Such modification includesmodulating the production of an endogenous product, as to amount,relative distribution, or the like, or production of an exogenousexpression product to provide for a novel function or product in theplant. For example, a heterologous nucleotide sequence that encodes agene product that confers herbicide, salt, cold, drought, pathogen orinsect resistance is encompassed.

In a further aspect, methods of the embodiments relate to a method formodulating expression of a gene in the green tissues of a stablytransformed plant comprising the steps of (a) transforming a plant cellwith a DNA construct comprising the promoter of the present inventionoperably linked to at least one nucleotide sequence; (b) growing theplant cell under plant growing conditions and (c) regenerating a stablytransformed plant from the plant cell wherein expression of thenucleotide sequence alters the phenotype of the plant.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the average abundance of the maize PsaH gene MPSS tag invarious maize tissues.

DETAILED DESCRIPTION OF THE INVENTION

The compositions of the present invention comprise novel nucleotidesequences for plant promoters, particularly a maize promoter with agreen tissue-preferred expression pattern. In particular, the presentinvention provides for isolated nucleic acid molecules comprising thenucleotide sequence set forth in SEQ ID NO:1, which was deposited inbacterial hosts as Patent Deposit No. NRRL-B-50179 on Sep. 25, 2008 andfragments, variants, and complements thereof.

A deposit of the maize PsaH promoter was made on Sep. 25, 2008 with theAgricultural Research Service (ARS) Culture Collection, housed in theMicrobial Genomics and Bioprocessing Research Unit of the NationalCenter for Agricultural Utilization Research (NCAUR), under the BudapestTreaty provisions. The deposit was given the following accession number:NRRL-B-50179. The address of NCAUR is 1815 N. University Street, Peoria,Ill., 61604. This deposit will be maintained under the terms of theBudapest Treaty on the International Recognition of the Deposit ofMicroorganisms for the Purposes of Patent Procedure. This deposit wasmade merely as a convenience for those of skill in the art and is not anadmission that a deposit is required under 35 U.S.C. §112. The depositwill irrevocably and without restriction or condition be available tothe public upon issuance of a patent. However, it should be understoodthat the availability of a deposit does not constitute a license topractice the subject invention in derogation of patent rights granted bygovernment action.

The promoter sequences of the embodiments are useful for expressingoperably linked nucleotide sequences in a tissue-preferred, particularlya green tissue-preferred manner. The sequences of the embodiments alsofind use in the construction of expression vectors for subsequenttransformation into plants of interest, as probes for the isolation ofother similar promoters, as molecular markers, and the like.

The promoter of the embodiments was isolated from maize genomic DNA byPCR amplification. The specific method used to obtain the promoter ofthe present invention is described in detail in Example 2 in theexperimental section of this application.

The embodiments encompass isolated or substantially purified nucleicacid compositions. An “isolated” or “purified” nucleic acid molecule, orbiologically active portion thereof, is substantially free of othercellular material or culture medium when produced by recombinanttechniques, or substantially free of chemical precursors or otherchemicals when chemically synthesized. Generally, an “isolated” nucleicacid is free of sequences (for example, protein encoding sequences) thatnaturally flank the nucleic acid (i.e., sequences located at the 5′ and3′ ends of the nucleic acid) in the genomic DNA of the organism fromwhich the nucleic acid is derived. For example, in various embodiments,the isolated nucleic acid molecule can contain less than about 5 kb, 4kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequences thatnaturally flank the nucleic acid molecule in genomic DNA of the cellfrom which the nucleic acid is derived.

The PsaH gene encodes a Zea mays photosystem I complex PsaH subunitprecursor. The gene is expressed predominately in maize leaf and whorltissue, as demonstrated by gene tissue profile comparisons derived fromMassively Parallel Signature Sequencing (MPSS), as further discussed inExample 1. The gene is expressed secondarily in husk, whorl, silk,tassel, and stalk tissue.

The promoter sequence of the present invention directs expression ofoperably linked nucleotide sequences in a green tissue-preferred manner.Therefore, the promoter sequence finds use in the green tissue-preferredexpression of an operably linked nucleotide sequence of interest.

The compositions of the embodiments include isolated nucleic acidmolecules comprising the promoter nucleotide sequence set forth in SEQID NO: 1. The term “promoter” is intended to mean a regulatory region ofDNA usually comprising a TATA box capable of directing RNA polymerase IIto initiate RNA synthesis at the appropriate transcription initiationsite for a particular coding sequence. A promoter may additionallycomprise other recognition sequences generally positioned upstream or 5′to the TATA box, referred to as upstream promoter elements, whichinfluence the transcription initiation rate. It is recognized thathaving identified the nucleotide sequences for the promoter regionsdisclosed herein, it is within the state of the art to isolate andidentify further regulatory elements in the 5′ untranslated regionupstream from the particular promoter regions identified herein. Thus,for example, the promoter regions disclosed herein may further compriseupstream regulatory elements such as those responsible for tissue andtemporal expression of the coding sequence, enhancers, and the like. Seeparticularly Australian Patent No. AU-A-77751/94 and U.S. Pat. Nos.5,466,785 and 5,635,618. In the same manner, the promoter elements thatenable expression in the desired tissue can be identified, isolated, andused with other core promoters to confer green tissue-preferredexpression. In this aspect of the embodiments, a “core promoter” isintended to mean a promoter without promoter elements.

In the context of this disclosure, the term “regulatory element” alsorefers to a sequence of DNA, usually, but not always, upstream (5′) tothe coding sequence of a structural gene, which includes sequences whichcontrol the expression of the coding region by providing the recognitionfor RNA polymerase and/or other factors required for transcription tostart at a particular site. An example of a regulatory element thatprovides for the recognition for RNA polymerase or other transcriptionalfactors to ensure initiation at a particular site is a promoter element.A promoter element comprises a core promoter element, responsible forthe initiation of transcription, as well as other regulatory elements(as discussed elsewhere in this application) that modify geneexpression. It is to be understood that nucleotide sequences, locatedwithin introns or 3′ of the coding region sequence, may also contributeto the regulation of expression of a coding region of interest. Examplesof suitable introns include, but are not limited to, the maize IVS6intron, or the maize actin intron. A regulatory element may also includethose elements located downstream (3′) to the site of transcriptioninitiation, or within transcribed regions, or both. In the context ofthe present invention a post-transcriptional regulatory element mayinclude elements that are active following transcription initiation, forexample translational and transcriptional enhancers, translational andtranscriptional repressors, and mRNA stability determinants.

The regulatory elements, or fragments thereof, of the present inventionmay be operatively associated with heterologous regulatory elements orpromoters in order to modulate the activity of the heterologousregulatory element. Such modulation includes (1) enhancing or repressingtranscriptional activity of the heterologous regulatory element; (2)modulating post-transcriptional events; or (3) both enhancing orrepressing transcriptional activity of the heterologous regulatoryelement and modulating post-transcriptional events. For example, one ormore regulatory elements, or fragments thereof, of the present inventionmay be operatively associated with constitutive, inducible, or tissuespecific promoters or fragment thereof, to modulate the activity of suchpromoters within desired tissues within plant cells.

The maize green tissue-preferred promoter sequence of the presentinvention, when assembled within a DNA construct such that the promoteris operably linked to a nucleotide sequence of interest, enablesexpression of the nucleotide sequence in the cells of a plant stablytransformed with this DNA construct. The term “operably linked” isintended to mean that the transcription or translation of theheterologous nucleotide sequence is under the influence of the promotersequence. “Operably linked” is also intended to mean the joining of twonucleotide sequences such that the coding sequence of each DNA fragmentremain in the proper reading frame. In this manner, the nucleotidesequences for the promoters of the embodiments are provided in DNAconstructs along with the nucleotide sequence of interest, typically aheterologous nucleotide sequence, for expression in the plant ofinterest. The term “heterologous nucleotide sequence” is intended tomean a sequence that is not naturally operably linked with the promotersequence. While this nucleotide sequence is heterologous to the promotersequence, it may be homologous, or native; or heterologous, or foreign,to the plant host.

The regulatory sequences of the present invention, when operably linkedto a heterologous nucleotide sequence of interest and stablyincorporated into the plant genome drive “green tissue-preferred”expression of the heterologous nucleotide sequence. The term “greentissue-preferred” is intended to mean that expression of theheterologous nucleotide sequence is most abundant in the green coloredtissues of a plant, such as, but not limited to, leaves. While somelevel of expression of the heterologous nucleotide sequence may occur inother plant tissue types, expression occurs most abundantly in the greentissues of the plant.

It is recognized that the promoters of the embodiments thereof may beused with their native coding sequences to increase or decreaseexpression, thereby resulting in a change in phenotype of thetransformed plant.

Modifications of the isolated promoter sequences of the presentinvention can provide for a range of expression of the heterologousnucleotide sequence. Thus, they may be modified to be weak promoters orstrong promoters. Generally, a “weak promoter” is intended to mean apromoter that drives expression of a coding sequence at a low level. A“low level” of expression is intended to mean expression at levels ofabout 1/10,000 transcripts to about 1/100,000 transcripts to about1/500,000 transcripts. Conversely, a strong promoter drives expressionof a coding sequence at a high level, or at about 1/10 transcripts toabout 1/100 transcripts to about 1/1,000 transcripts.

Fragments and variants of the disclosed promoter sequences are alsoencompassed by the present invention. A “fragment” is intended to mean aportion of the promoter sequence. Fragments of a promoter sequence mayretain biological activity and hence encompass fragments capable ofdriving green tissue-preferred expression of an operably linkednucleotide sequence. Thus, for example, less than the entire promotersequence disclosed herein may be utilized to drive expression of anoperably linked nucleotide sequence of interest, such as a nucleotidesequence encoding a heterologous protein. It is within skill in the artto determine whether such fragments decrease expression levels or alterthe nature of expression, i.e., constitutive or inducible expression.Alternatively, fragments of a promoter nucleotide sequence that areuseful as hybridization probes, such as described below, generally donot retain this regulatory activity. Thus, fragments of a nucleotidesequence may range from at least about 20 nucleotides, about 50nucleotides, about 100 nucleotides, and up to the full-length nucleotidesequence of the embodiments.

Thus, a fragment of the promoter nucleotide sequence of SEQ ID NO: 1 mayencode a biologically active portion of the promoter or it may be afragment that can be used as a hybridization probe or PCR primer usingmethods disclosed below. A biologically active portion of a promoter canbe prepared by isolating a portion of the promoter nucleotide sequenceof the embodiments and assessing the activity of that portion of thepromoter. Nucleic acid molecules that are fragments of a promoternucleotide sequence comprise at least 15, 20, 25, 30, 35, 40, 45, 50,75, 100, 325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 800,900, 1000, 1100, 1200, 1300, 1400 or up to the number of nucleotidespresent in the full-length promoter nucleotide sequence disclosedherein, e.g. 1442 nucleotides for SEQ ID NO:1.

The nucleotides of such fragments will usually comprise the TATArecognition sequence of the particular promoter sequence. Such fragmentsmay be obtained by use of restriction enzymes to cleave the naturallyoccurring promoter nucleotide sequence disclosed herein; by synthesizinga nucleotide sequence from the naturally occurring sequence of thepromoter DNA sequence; or may be obtained through the use of PCRtechnology. See particularly, Mullis et al. (1987) Methods Enzymol.155:335-350, and Erlich, ed. (1989) PCR Technology (Stockton Press, NewYork). Variants of these promoter fragments, such as those resultingfrom site-directed mutagenesis and a procedure such as DNA “shuffling”,are also encompassed by the compositions of the present invention.

An “analogue” of the regulatory elements of the present inventionincludes any substitution, deletion, or addition to the sequence of aregulatory element provided that said analogue maintains at least oneregulatory property associated with the activity of the regulatoryelement of the present invention. Such properties include directingorgan specificity, tissue specificity, or a combination thereof, ortemporal activity, or developmental activity, or a combination thereof.

The term “variants” is intended to mean sequences having substantialsimilarity with a promoter sequence disclosed herein. For nucleotidesequences, naturally occurring variants such as these can be identifiedwith the use of well-known molecular biology techniques, as, forexample, with polymerase chain reaction (PCR) and hybridizationtechniques as outlined below. Variant nucleotide sequences also includesynthetically derived nucleotide sequences, such as those generated, forexample, by using site-directed mutagenesis. Generally, variants of aparticular nucleotide sequence of the embodiments will have at least40%, 50%, 60%, 65%, 70%, generally at least 75%, 80%, 85%, 90%, 91%,92%, 93%, 94%, to 95%, 96%, 97%, 98%, 99% or more sequence identity tothat particular nucleotide sequence as determined by sequence alignmentprograms described elsewhere herein using default parameters.Biologically active variants are also encompassed by the presentinvention. Biologically active variants include, for example, the nativepromoter sequence of the embodiments having one or more nucleotidesubstitutions, deletions, or insertions. Promoter activity may bemeasured by using techniques such as Northern blot analysis, reporteractivity measurements taken from transcriptional fusions, and the like.See, for example, Sambrook et al. (1989) Molecular Cloning: A LaboratoryManual (2d ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor,N.Y.), hereinafter “Sambrook,” herein incorporated by reference.Alternatively, levels of a reporter gene such as green fluorescentprotein (GFP) or the like produced under the control of a promoterfragment or variant can be measured. See, for example, U.S. Pat. No.6,072,050, herein incorporated by reference.

Methods for mutagenesis and nucleotide sequence alterations are wellknown in the art. See, for example, Kunkel (1985) Proc. Natl. Acad. Sci.USA 82:488-492; Kunkel et al. (1987) Methods in Enzymol. 154:367-382;U.S. Pat. No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques inMolecular Biology (MacMillan Publishing Company, New York) and thereferences cited therein.

Variant promoter nucleotide sequences also encompass sequences derivedfrom a mutagenic and recombinogenic procedure such as DNA shuffling.With such a procedure, one or more different promoter sequences can bemanipulated to create a new promoter possessing the desired properties.In this manner, libraries of recombinant polynucleotides are generatedfrom a population of related sequence polynucleotides comprisingsequence regions that have substantial sequence identity and can behomologously recombined in vitro or in vivo. Strategies for such DNAshuffling are known in the art. See, for example, Stemmer (1994) Proc.Natl. Acad. Sci. USA 91:10747-10751; Stemmer (1994) Nature 370:389-391;Crameri et al. (1997) Nature Biotech. 15:436-438; Moore et al. (1997) J.Mol. Biol. 272:336-347; Zhang et al. (1997) Proc. Natl. Acad. Sci. USA94:4504-4509; Crameri et al. (1998) Nature 391:288-291; and U.S. Pat.Nos. 5,605,793 and 5,837,458.

The nucleotide sequences of the embodiments can be used to isolatecorresponding sequences from other organisms, particularly other plants,for example, other monocots. In this manner, methods such as PCR,hybridization, and the like can be used to identify such sequences basedon their sequence homology to the sequence set forth herein. Sequencesisolated based on their sequence identity to the entire promotersequence set forth herein or to fragments thereof are encompassed by thepresent invention. The promoter regions of the embodiments may beisolated from any plant, including, but not limited to corn (Zea mays),Brassica (Brassica napus, Brassica rapa ssp.), alfalfa (Medicagosativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghumbicolor, Sorghum vulgare), sunflower (Helianthus annuus), wheat(Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum),potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton(Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihotesculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple(Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao),tea (Camellia sinensis), banana (Musa spp.), avocado (Persea americana),fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica),olive (Olea europaea), oats, safflower, barley, vegetables, ornamentals,and conifers.

In a PCR approach, oligonucleotide primers can be designed for use inPCR reactions to amplify corresponding DNA sequences from cDNA orgenomic DNA extracted from any plant of interest. Methods for designingPCR primers and PCR cloning are generally known in the art and aredisclosed in Sambrook, supra. See also Innis et al., eds. (1990) PCRProtocols: A Guide to Methods and Applications (Academic Press, NewYork); Innis and Gelfand, eds. (1995) PCR Strategies (Academic Press,New York); and Innis and Gelfand, eds. (1999) PCR Methods Manual(Academic Press, New York). Known methods of PCR include, but are notlimited to, methods using paired primers, nested primers, singlespecific primers, degenerate primers, gene-specific primers,vector-specific primers, partially-mismatched primers, and the like.

In hybridization techniques, all or part of a known nucleotide sequenceis used as a probe that selectively hybridizes to other correspondingnucleotide sequences present in a population of cloned genomic DNAfragments or cDNA fragments (i.e., genomic or cDNA libraries) from achosen organism. The hybridization probes may be genomic DNA fragments,cDNA fragments, RNA fragments, or other oligonucleotides, and may belabeled with a detectable group such as ³²P, or any other detectablemarker. Thus, for example, probes for hybridization can be made bylabeling synthetic oligonucleotides based on the promoter sequence ofthe embodiments. Methods for preparation of probes for hybridization andfor construction of cDNA and genomic libraries are generally known inthe art and are disclosed in Sambrook, supra.

For example, the entire promoter sequence disclosed herein (SEQ ID NO:1), or one or more portions thereof, may be used as a probe capable ofspecifically hybridizing to corresponding promoter sequence. To achievespecific hybridization under a variety of conditions, such probesinclude sequences that are unique among promoter sequences and are atleast about 10 nucleotides in length, and generally at least about 20nucleotides in length. Such probes may be used to amplify correspondingpromoter sequences from a chosen plant by PCR. This technique may beused to isolate additional coding sequences from a desired plant or as adiagnostic assay to determine the presence of coding sequences in aplant. Hybridization techniques include hybridization screening ofplated DNA libraries (either plaques or colonies; see, for example,Sambrook supra).

Hybridization of such sequences may be carried out under stringentconditions. By “stringent conditions” or “stringent hybridizationconditions” is intended conditions under which a probe will hybridize toits target sequence to a detectably greater degree than to othersequences (e.g., at least 2-fold over background). Stringent conditionsare sequence-dependent and will be different in different circumstances.By controlling the stringency of the hybridization and/or washingconditions, target sequences that are 100% complementary to the probecan be identified (homologous probing). Alternatively, stringencyconditions can be adjusted to allow some mismatching in sequences sothat lower degrees of similarity are detected (heterologous probing).Generally, a probe is less than about 1000 nucleotides in length, oftenless than 500 nucleotides in length.

Typically, stringent conditions will be those in which the saltconcentration is less than about 1.5 M Na ion, typically about 0.01 to1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and thetemperature is at least about 30° C. for short probes (e.g., 10 to 50nucleotides) and at least about 60° C. for long probes (e.g., greaterthan 50 nucleotides). Stringent conditions may also be achieved with theaddition of destabilizing agents such as formamide. Exemplary lowstringency conditions include hybridization with a buffer solution of 30to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 37° C.,and a wash in 1× to 2×SSC (20×SSC=3.0 M NaCl/0.3 M trisodium citrate) at50 to 55° C. Exemplary moderate stringency conditions includehybridization in 40 to 45% formamide, 1.0 M NaCl, 1% SDS at 37° C., anda wash in 0.5× to 1×SSC at 55 to 60° C. Exemplary high stringencyconditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at37° C., and a wash in 0.1×SSC at 60 to 65° C. for at least 30 minutes.Duration of hybridization is generally less than about 24 hours, usuallyabout 4 to about 12 hours.

Specificity is typically the function of post-hybridization washes, thecritical factors being the ionic strength and temperature of the finalwash solution. For DNA-DNA hybrids, the thermal melting point (T_(m))can be approximated from the equation of Meinkoth and Wahl (1984) Anal.Biochem. 138:267-284: T_(m)=81.5° C.+16.6 (log M)+0.41 (% GC)−0.61 (%form)−500/L; where M is the molarity of monovalent cations, % GC is thepercentage of guanosine and cytosine nucleotides in the DNA, % form isthe percentage of formamide in the hybridization solution, and L is thelength of the hybrid in base pairs. The T_(m) is the temperature (underdefined ionic strength and pH) at which 50% of a complementary targetsequence hybridizes to a perfectly matched probe. T_(m) is reduced byabout 1° C. for each 1% of mismatching; thus, T_(m), hybridization,and/or wash conditions can be adjusted to hybridize to sequences of thedesired identity. For example, if sequences with ≧90% identity aresought, the T_(m) can be decreased 10° C. Generally, stringentconditions are selected to be about 5° C. lower than the T_(m) for thespecific sequence and its complement at a defined ionic strength and pH.However, severely stringent conditions can utilize a hybridizationand/or wash at 1, 2, 3, or 4° C. lower than the T_(m); moderatelystringent conditions can utilize a hybridization and/or wash at 6, 7, 8,9, or 10° C. lower than the T_(m); low stringency conditions can utilizea hybridization and/or wash at 11, 12, 13, 14, 15, or 20° C. lower thanthe T_(m). Using the equation, hybridization and wash compositions, anddesired T_(m), those of ordinary skill will understand that variationsin the stringency of hybridization and/or wash solutions are inherentlydescribed. If the desired degree of mismatching results in a T_(m) ofless than 45° C. (aqueous solution) or 32° C. (formamide solution), itis preferred to increase the SSC concentration so that a highertemperature can be used. An extensive guide to the hybridization ofnucleic acids is found in Tijssen (1993) Laboratory Techniques inBiochemistry and Molecular Biology—Hybridization with Nucleic AcidProbes, Part I, Chapter 2 (Elsevier, New York); and Ausubel et al., eds.(1995) Current Protocols in Molecular Biology, Chapter 2 (GreenePublishing and Wiley-Interscience, New York), hereinafter “Ausubel”. Seealso Sambrook supra.

Thus, isolated sequences that have green tissue-preferred promoteractivity and which hybridize under stringent conditions to the promotersequences disclosed herein, or to fragments thereof, are encompassed bythe present invention.

In general, sequences that have promoter activity and hybridize to thepromoter sequences disclosed herein will be at least 40% to 50%homologous, about 60% to 70% homologous, and even about 80%, 85%, 90%,95% to 98% homologous or more with the disclosed sequences. That is, thesequence similarity of sequences may range, sharing at least about 40%to 50%, about 60% to 70%, and even about 80%, 85%, 90%, 95% to 98%sequence similarity.

The following terms are used to describe the sequence relationshipsbetween two or more nucleic acids or polynucleotides: (a) “referencesequence”, (b) “comparison window”, (c) “sequence identity”, (d)“percentage of sequence identity”, and (e) “substantial identity”.

(a) As used herein, “reference sequence” is a defined sequence used as abasis for sequence comparison. A reference sequence may be a subset orthe entirety of a specified sequence; for example, as a segment of afull-length cDNA or gene sequence, or the complete cDNA or genesequence.

(b) As used herein, “comparison window” makes reference to a contiguousand specified segment of a polynucleotide sequence, wherein thepolynucleotide sequence in the comparison window may comprise additionsor deletions (i.e., gaps) compared to the reference sequence (which doesnot comprise additions or deletions) for optimal alignment of the twosequences. Generally, the comparison window is at least 20 contiguousnucleotides in length, and optionally can be 30, 40, 50, 100, or longer.Those of skill in the art understand that to avoid a high similarity toa reference sequence due to inclusion of gaps in the polynucleotidesequence a gap penalty is typically introduced and is subtracted fromthe number of matches.

Methods of alignment of sequences for comparison are well known in theart. Thus, the determination of percent sequence identity between anytwo sequences can be accomplished using a mathematical algorithm.Preferred, non-limiting examples of such mathematical algorithms are thealgorithm of Myers and Miller (1988) CABIOS 4:11-17; the local homologyalgorithm of Smith et al. (1981) Adv. Appl. Math. 2:482; the homologyalignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol.48:443-453; the search-for-similarity-method of Pearson and Lipman(1988) Proc. Natl. Acad. Sci. 85:2444-2448; the algorithm of Karlin andAltschul (1990) Proc. Natl. Acad. Sci. USA 872264, modified as in Karlinand Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877.

Computer implementations of these mathematical algorithms can beutilized for comparison of sequences to determine sequence identity.Such implementations include, but are not limited to: CLUSTAL in thePC/Gene program (available from Intelligenetics, Mountain View, Calif.);the ALIGN program (Version 2.0); the ALIGN PLUS program (Version 3.0,copyright 1997): and GAP, BESTFIT, BLAST, FASTA, and TFASTA in theWisconsin Genetics Software Package of Genetics Computer Group, Version10 (available from Accelrys, 9685 Scranton Road, San Diego, Calif.,92121, USA). The scoring matrix used in Version 10 of the WisconsinGenetics Software Package is BLOSUM62 (see Henikoff and Henikoff (1989)Proc. Natl. Acad. Sci. USA 89:10915).

Alignments using these programs can be performed using the defaultparameters. The CLUSTAL program is well described by Higgins et al.(1988) Gene 73:237-244 (1988); Higgins et al. (1989) CABIOS 5:151-153;Corpet et al. (1988) Nucleic Acids Res. 16:10881-90; Huang et al. (1992)CABIOS 8:155-65; and Pearson et al. (1994) Meth. Mol. Biol. 24:307-331.The ALIGN and the ALIGN PLUS programs are based on the algorithm ofMyers and Miller (1988) supra. A PAM120 weight residue table, a gaplength penalty of 12, and a gap penalty of 4 can be used with the ALIGNprogram when comparing amino acid sequences. The BLAST programs ofAltschul et al. (1990) J. Mol. Biol. 215:403 are based on the algorithmof Karlin and Altschul (1990) supra. BLAST nucleotide searches can beperformed with the BLASTN program, score=100, wordlength=12, to obtainnucleotide sequences homologous to a nucleotide sequence encoding aprotein of the embodiments. BLAST protein searches can be performed withthe BLASTX program, score=50, wordlength=3, to obtain amino acidsequences homologous to a protein or polypeptide of the embodiments. Toobtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST2.0) can be utilized as described in Altschul et al. (1997) NucleicAcids Res. 25:3389. Alternatively, PSI-BLAST (in BLAST 2.0) can be usedto perform an iterated search that detects distant relationships betweenmolecules. See Altschul et al. (1997) supra. When utilizing BLAST,Gapped BLAST, PSI-BLAST, the default parameters of the respectiveprograms (e.g., BLASTN for nucleotide sequences, BLASTX for proteins)can be used, available on the web site for the National Center forBiotechnology Information. Alignment may also be performed manually byinspection.

Unless otherwise stated, sequence identity/similarity values providedherein refer to the value obtained using the GAP program with defaultparameters, or any equivalent program. By “equivalent program” isintended any sequence comparison program that, for any two sequences inquestion, generates an alignment having identical nucleotide or aminoacid residue matches and an identical percent sequence identity whencompared to the corresponding alignment generated by GAP.

The GAP program uses the algorithm of Needleman and Wunsch (1970) supra,to find the alignment of two complete sequences that maximizes thenumber of matches and minimizes the number of gaps. GAP considers allpossible alignments and gap positions and creates the alignment with thelargest number of matched bases and the fewest gaps. It allows for theprovision of a gap creation penalty and a gap extension penalty in unitsof matched bases. GAP must make a profit of gap creation penalty numberof matches for each gap it inserts. If a gap extension penalty greaterthan zero is chosen, GAP must, in addition, make a profit for each gapinserted of the length of the gap times the gap extension penalty.Default gap creation penalty values and gap extension penalty values inVersion 10 of the Wisconsin Genetics Software Package for proteinsequences are 8 and 2, respectively. For nucleotide sequences thedefault gap creation penalty is 50 while the default gap extensionpenalty is 3. The gap creation and gap extension penalties can beexpressed as an integer selected from the group of integers consistingof from 0 to 200. Thus, for example, the gap creation and gap extensionpenalties can be 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35,40, 45, 50, 55, 60, 65 or greater.

(c) As used herein, “sequence identity” or “identity” in the context oftwo nucleic acid or polypeptide sequences makes reference to theresidues in the two sequences that are the same when aligned for maximumcorrespondence over a specified comparison window. When percentage ofsequence identity is used in reference to proteins it is recognized thatresidue positions which are not identical often differ by conservativeamino acid substitutions, where amino acid residues are substituted forother amino acid residues with similar chemical properties (e.g., chargeor hydrophobicity) and therefore do not change the functional propertiesof the molecule. When sequences differ in conservative substitutions,the percent sequence identity may be adjusted upwards to correct for theconservative nature of the substitution. Sequences that differ by suchconservative substitutions are said to have “sequence similarity” or“similarity”. Means for making this adjustment are well known to thoseof skill in the art. Typically this involves scoring a conservativesubstitution as a partial rather than a full mismatch, therebyincreasing the percentage sequence identity. Thus, for example, where anidentical amino acid is given a score of 1 and a non-conservativesubstitution is given a score of zero, a conservative substitution isgiven a score between zero and 1. The scoring of conservativesubstitutions is calculated, e.g., as implemented in the program PC/GENE(Intelligenetics, Mountain View, Calif.).

(d) As used herein, “percentage of sequence identity” means the valuedetermined by comparing two optimally aligned sequences over acomparison window, wherein the portion of the polynucleotide sequence inthe comparison window may comprise additions or deletions (i.e., gaps)as compared to the reference sequence (which does not comprise additionsor deletions) for optimal alignment of the two sequences. The percentageis calculated by determining the number of positions at which theidentical nucleic acid base or amino acid residue occurs in bothsequences to yield the number of matched positions, dividing the numberof matched positions by the total number of positions in the window ofcomparison, and multiplying the result by 100 to yield the percentage ofsequence identity.

(e)(i) The term “substantial identity” of polynucleotide sequences meansthat a polynucleotide comprises a sequence that has at least 70%sequence identity, at least 80%, at least 90%, and at least 95%,compared to a reference sequence using one of the alignment programsdescribed using standard parameters. One of skill in the art willrecognize that these values can be appropriately adjusted to determinecorresponding identity of proteins encoded by two nucleotide sequencesby taking into account codon degeneracy, amino acid similarity, readingframe positioning, and the like. Substantial identity of amino acidsequences for these purposes normally means sequence identity of atleast 60%, 70%, 80%, 90%, or 95%.

Another indication that nucleotide sequences are substantially identicalis if two molecules hybridize to each other under stringent conditions.Generally, stringent conditions are selected to be about 5° C. lowerthan the T_(m) for the specific sequence at a defined ionic strength andpH. However, stringent conditions encompass temperatures in the range ofabout 1° C. to about 20° C. lower than the T_(m), depending upon thedesired degree of stringency as otherwise qualified herein. Nucleicacids that do not hybridize to each other under stringent conditions arestill substantially identical if the polypeptides they encode aresubstantially identical. This may occur, e.g., when a copy of a nucleicacid is created using the maximum codon degeneracy permitted by thegenetic code. One indication that two nucleic acid sequences aresubstantially identical is when the polypeptide encoded by the firstnucleic acid is immunologically cross reactive with the polypeptideencoded by the second nucleic acid.

The promoter sequence disclosed herein, as well as variants andfragments thereof, are useful for genetic engineering of plants, e.g.for the production of a transformed or transgenic plant, to express aphenotype of interest. As used herein, the terms “transformed plant” and“transgenic plant” refer to a plant that comprises within its genome aheterologous polynucleotide. Generally, the heterologous polynucleotideis stably integrated within the genome of a transgenic or transformedplant such that the polynucleotide is passed on to successivegenerations. The heterologous polynucleotide may be integrated into thegenome alone or as part of a recombinant DNA construct or expressioncassette. It is to be understood that as used herein the term“transgenic” includes any cell, cell line, callus, tissue, plant part,or plant the genotype of which has been altered by the presence ofheterologous nucleic acid including those transgenics initially soaltered as well as those created by sexual crosses or asexualpropagation from the initial transgenic. The term “transgenic” as usedherein does not encompass the alteration of the genome (chromosomal orextra-chromosomal) by conventional plant breeding methods or bynaturally occurring events such as random cross-fertilization,non-recombinant viral infection, non-recombinant bacterialtransformation, non-recombinant transposition, or spontaneous mutation.

A transgenic “event” is produced by transformation of plant cells with aheterologous DNA construct, including a nucleic acid expression cassettethat comprises a transgene of interest, the regeneration of a populationof plants resulting from the insertion of the transgene into the genomeof the plant, and selection of a particular plant characterized byinsertion into a particular genome location. An event is characterizedphenotypically by the expression of the transgene. At the genetic level,an event is part of the genetic makeup of a plant. The term “event” alsorefers to progeny produced by a sexual outcross between the transformantand another variety that include the heterologous DNA.

As used herein, the term “plant” includes reference to whole plants,plant organs (e.g., leaves, stems, roots, etc.), seeds, plant cells, andprogeny of same. Parts of transgenic plants are to be understood withinthe scope of the embodiments to comprise, for example, plant cells,protoplasts, tissues, callus, embryos as well as flowers, stems, fruits,ovules, leaves, or roots originating in transgenic plants or theirprogeny previously transformed with a DNA molecule of the embodiments,and therefore consisting at least in part of transgenic cells, are alsoan object of the present invention.

As used herein, the term “plant cell” includes, without limitation,seeds suspension cultures, embryos, meristematic regions, callus tissue,leaves, roots, shoots, gametophytes, sporophytes, pollen, andmicrospores. The class of plants that can be used in the methods of theembodiments is generally as broad as the class of higher plants amenableto transformation techniques, including both monocotyledonous anddicotyledonous plants.

The promoter sequences and methods disclosed herein are useful inregulating expression of any heterologous nucleotide sequence in a hostplant. Thus, the heterologous nucleotide sequence operably linked to thepromoters disclosed herein may be a structural gene encoding a proteinof interest. Genes of interest are reflective of the commercial marketsand interests of those involved in the development of the crop. Cropsand markets of interest change, and as developing nations open up worldmarkets, new crops and technologies will emerge also. In addition, asour understanding of agronomic traits and characteristics such as yieldand heterosis increase, the choice of genes for transformation willchange accordingly. General categories of genes of interest for thepresent invention include, for example, those genes involved ininformation, such as zinc fingers, those involved in communication, suchas kinases, and those involved in housekeeping, such as heat shockproteins. More specific categories of transgenes, for example, includegenes encoding proteins conferring resistance to abiotic stress, such asdrought, temperature, salinity, and toxins such as pesticides andherbicides, or to biotic stress, such as attacks by fungi, viruses,bacteria, insects, and nematodes, and development of diseases associatedwith these organisms. Various changes in phenotype are of interestincluding modifying expression of a gene in a plant's green tissues,altering a plant's pathogen or insect defense mechanism, increasing theplant's tolerance to herbicides, altering plant development to respondto environmental stress, and the like. The results can be achieved byproviding expression of heterologous or increased expression ofendogenous products in plants. Alternatively, the results can beachieved by providing for a reduction of expression of one or moreendogenous products, particularly enzymes, transporters, or cofactors,or affecting nutrients uptake in the plant. These changes result in achange in phenotype of the transformed plant.

It is recognized that any gene of interest can be operably linked to thepromoter sequences of the embodiments and expressed in a plant's greentissues.

A DNA construct comprising one of these genes of interest can be usedwith transformation techniques, such as those described below, to createdisease or insect resistance in susceptible plant phenotypes or toenhance disease or insect resistance in resistant plant phenotypes.Accordingly, the embodiments encompass methods that are directed toprotecting plants against fungal pathogens, bacteria, viruses,nematodes, insects, and the like. By “disease resistance” or “insectresistance” is intended that the plants avoid the harmful symptoms thatare the outcome of the plant-pathogen interactions.

Disease resistance and insect resistance genes such as lysozymes,cecropins, maganins, or thionins for antibacterial protection, or thepathogenesis-related (PR) proteins such as glucanases and chitinases foranti-fungal protection, or Bacillus thuringiensis endotoxins, proteaseinhibitors, collagenases, lectins, and glycosidases for controllingnematodes or insects are all examples of useful gene products.

Plant pathogens of interest include, but are not limited to, viruses orviroids, bacteria, insects, nematodes, fungi, and the like. Virusesinclude tobacco or cucumber mosaic virus, ringspot virus, necrosisvirus, maize dwarf mosaic virus, etc. Nematodes include parasiticnematodes such as root knot, cyst, and lesion nematodes, etc.

Insect resistance genes may encode resistance to pests that have greatyield drag such as rootworm, cutworm, European corn borer, and the like.Such genes include, for example, Bacillus thuringiensis toxic proteingenes (U.S. Pat. Nos. 5,366,892; 5,747,450; 5,737,514; 5,723,756;5,593,881; and Geiser et al. (1986) Gene 48:109); lectins (Van Damme etal. (1994) Plant Mol. Biol. 24:825); and the like.

Genes encoding disease resistance traits include detoxification genes,such as against fumonisin (U.S. Pat. No. 5,792,931) avirulence (avr) anddisease resistance (R) genes (Jones et al. (1994) Science 266:789;Martin et al. (1993) Science 262:1432; Mindrinos et al. (1994) Cell78:1089); and the like.

Herbicide resistance traits may be introduced into plants by genescoding for resistance to herbicides that act to inhibit the action ofacetolactate synthase (ALS), in particular the sulfonylurea-typeherbicides (e.g., the acetolactate synthase (ALS) gene containingmutations leading to such resistance, in particular the S4 and/or Hramutations), genes coding for resistance to herbicides that act toinhibit action of glutamine synthase, such as phosphinothricin or basta(e.g., the bar gene), or other such genes known in the art. The bar geneencodes resistance to the herbicide basta, the nptII gene encodesresistance to the antibiotics kanamycin and geneticin, and the ALS geneencodes resistance to the herbicide chlorsulfuron.

Glyphosate resistance is imparted by mutant 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) and aroA genes. See, forexample, U.S. Pat. No. 4,940,835, which discloses the nucleotidesequence of a form of EPSPS which can confer glyphosate resistance. U.S.Pat. No. 5,627,061 also describes genes encoding EPSPS enzymes. See alsoU.S. Pat. Nos. 6,248,876; 6,040,497; 5,804,425; 5,633,435; 5,145,783;4,971,908; 5,312,910; 5,188,642; 4,940,835; 5,866,775; 6,225,114;6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; RE36,449; RE 37,287; and U.S. Pat. No. 5,491,288; and internationalpublications WO 97/04103; WO 97/04114; WO 00/66746; WO 01/66704; WO00/66747 and WO 00/66748, which are incorporated herein by reference forthis purpose. Glyphosate resistance is also imparted to plants thatexpress a gene that encodes a glyphosate oxido-reductase enzyme asdescribed more fully in U.S. Pat. Nos. 5,776,760 and 5,463,175, whichare incorporated herein by reference for this purpose. In additionglyphosate resistance can be imparted to plants by the over-expressionof genes encoding glyphosate N-acetyltransferase. See, for example, U.S.patent application Ser. No. 10/004,357 (now abandoned); U.S. Pat. Nos.7,462,481 and 7,405,074.

Sterility genes can also be encoded in an expression cassette andprovide an alternative to physical detasseling. Examples of genes usedin such ways include male tissue-preferred genes and genes with malesterility phenotypes such as QM, described in U.S. Pat. No. 5,583,210.Other genes include kinases and those encoding compounds toxic to eithermale or female gametophytic development.

Commercial traits can also be encoded on a gene or genes that couldincrease for example, starch for ethanol production, or provideexpression of proteins. Another important commercial use of transformedplants is the production of polymers and bioplastics such as describedin U.S. Pat. No. 5,602,321. Genes such as β-ketothiolase, PHBase(polyhydroxyburyrate synthase), and acetoacetyl-CoA reductase (seeSchubert et al. (1988) J. Bacteriol. 170:5837-5847) facilitateexpression of polyhyroxyalkanoates (PHAs).

Agronomically important traits that affect quality of grain, such aslevels and types of oils, saturated and unsaturated, quality andquantity of essential amino acids, levels of cellulose, starch, andprotein content can be genetically altered using the methods of thepresent invention. Modifications include increasing content of oleicacid, saturated and unsaturated oils, increasing levels of lysine andsulfur, providing essential amino acids, and modifying starch.Hordothionin protein modifications in corn are described in U.S. Pat.Nos. 5,990,389; 5,885,801; 5,885,802 and 5,703,049; herein incorporatedby reference. Another example is lysine and/or sulfur rich seed proteinencoded by the soybean 2S albumin described in U.S. Pat. No. 5,850,016,filed Mar. 20, 1996, and the chymotrypsin inhibitor from barley,Williamson et al. (1987) Eur. J. Biochem. 165:99-106, the disclosures ofwhich are herein incorporated by reference.

Exogenous products include plant enzymes and products as well as thosefrom other sources including prokaryotes and other eukaryotes. Suchproducts include enzymes, cofactors, hormones, and the like.

Examples of other applicable genes and their associated phenotypeinclude the gene that encodes viral coat protein and/or RNA, or otherviral or plant genes that confer viral resistance; genes that conferfungal resistance; genes that confer insect resistance; genes thatpromote yield improvement; and genes that provide for resistance tostress, such as dehydration resulting from heat and salinity, toxicmetal or trace elements, or the like.

“RNAi” refers to a series of related techniques to reduce the expressionof genes (See for example U.S. Pat. No. 6,506,559). Older techniquesreferred to by other names are now thought to rely on the samemechanism, but are given different names in the literature. Theseinclude “antisense inhibition,” the production of antisense RNAtranscripts capable of suppressing the expression of the target protein,and “co-suppression” or “sense-suppression,” which refer to theproduction of sense RNA transcripts capable of suppressing theexpression of identical or substantially similar foreign or endogenousgenes (U.S. Pat. No. 5,231,020, incorporated herein by reference). Suchtechniques rely on the use of constructs resulting in the accumulationof double stranded RNA with one strand complementary to the target geneto be silenced. The promoter sequence of the embodiments, and itsrelated biologically active fragments or variants disclosed herein, maybe used to drive expression of constructs that will result in RNAinterference including microRNAs and siRNAs.

The heterologous nucleotide sequence operably linked to the promoter andits related biologically active fragments or variants disclosed hereinmay be an antisense sequence for a targeted gene. The terminology“antisense DNA nucleotide sequence” is intended to mean a sequence thatis in inverse orientation to the 5′-to-3′ normal orientation of thatnucleotide sequence. When delivered into a plant cell, expression of theantisense DNA sequence prevents normal expression of the DNA nucleotidesequence for the targeted gene. The antisense nucleotide sequenceencodes an RNA transcript that is complementary to and capable ofhybridizing to the endogenous messenger RNA (mRNA) produced bytranscription of the DNA nucleotide sequence for the targeted gene. Inthis case, production of the native protein encoded by the targeted geneis inhibited to achieve a desired phenotypic response. Modifications ofthe antisense sequences may be made as long as the sequences hybridizeto and interfere with expression of the corresponding mRNA. In thismanner, antisense constructions having, for example, 70%, 80%, or 85%sequence identity to the corresponding antisense sequences may be used.Furthermore, portions of the antisense nucleotides may be used todisrupt the expression of the target gene. Generally, sequences of atleast 50 nucleotides, 100 nucleotides, 200 nucleotides, or greater maybe used. Thus, the promoter sequences disclosed herein may be operablylinked to antisense DNA sequences to reduce or inhibit expression of anative protein in the plant.

In one embodiment, DNA constructs will comprise a transcriptionalinitiation region comprising one of the promoter nucleotide sequencesdisclosed herein, or variants or fragments thereof, operably linked to aheterologous nucleotide sequence whose expression is to be controlled bythe inducible promoter of the embodiments. Such a DNA construct isprovided with a plurality of restriction sites for insertion of thenucleotide sequence to be under the transcriptional regulation of theregulatory regions. The DNA construct may additionally containselectable marker genes.

The DNA construct will include in the 5′-3′ direction of transcription,a transcriptional initiation region (i.e., a green tissue-preferredpromoter of the embodiments), translational initiation region, aheterologous nucleotide sequence of interest, a translationaltermination region and, optionally, a transcriptional termination regionfunctional in the host organism. The regulatory regions (i.e.,promoters, transcriptional regulatory regions, and translationaltermination regions) and/or the polynucleotide of the embodiments may benative/analogous to the host cell or to each other. Alternatively, theregulatory regions and/or the polynucleotide of the embodiments may beheterologous to the host cell or to each other. As used herein,“heterologous” in reference to a sequence is a sequence that originatesfrom a foreign species, or, if from the same species, is substantiallymodified from its native form in composition and/or genomic locus bydeliberate human intervention. For example, a promoter operably linkedto a heterologous polynucleotide is from a species different from thespecies from which the polynucleotide was derived, or, if from thesame/analogous species, one or both are substantially modified fromtheir original form and/or genomic locus, or the promoter is not thenative promoter for the operably linked polynucleotide.

The optionally included termination region may be native with thetranscriptional initiation region, may be native with the operablylinked polynucleotide of interest, may be native with the plant host, ormay be derived from another source (i.e., foreign or heterologous) tothe promoter, the polynucleotide of interest, the host, or anycombination thereof. Convenient termination regions are available fromthe Ti-plasmid of A. tumefaciens, such as the octopine synthase andnopaline synthase termination regions. See also Guerineau et al. (1991)Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfaconet al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989)Nucleic Acids Res. 17:7891-7903; and Joshi et al. (1987) Nucleic AcidsRes. 15:9627-9639. In particular embodiments, the potato proteaseinhibitor II gene (PinII) terminator is used. See, for example, Keil etal. (1986) Nucl. Acids Res. 14:5641-5650; and An et al. (1989) PlantCell 1:115-122, herein incorporated by reference in their entirety.

The DNA construct comprising a promoter sequence of the presentinvention operably linked to a heterologous nucleotide sequence may alsocontain at least one additional nucleotide sequence for a gene to becotransformed into the organism. Alternatively, the additionalsequence(s) can be provided on another DNA construct.

Where appropriate, the heterologous nucleotide sequence whose expressionis to be under the control of the inducible promoter sequence of thepresent invention and any additional nucleotide sequence(s) may beoptimized for increased expression in the transformed plant. That is,these nucleotide sequences can be synthesized using plant preferredcodons for improved expression. Methods are available in the art forsynthesizing plant-preferred nucleotide sequences. See, for example,U.S. Pat. Nos. 5,380,831 and 5,436,391, and Murray et al. (1989) NucleicAcids Res. 17:477-498, herein incorporated by reference.

Additional sequence modifications are known to enhance gene expressionin a cellular host. These include elimination of sequences encodingspurious polyadenylation signals, exon-intron splice site signals,transposon-like repeats, and other such well-characterized sequencesthat may be deleterious to gene expression. The G-C content of theheterologous nucleotide sequence may be adjusted to levels average for agiven cellular host, as calculated by reference to known genes expressedin the host cell. When possible, the sequence is modified to avoidpredicted hairpin secondary mRNA structures.

The DNA constructs may additionally contain 5′ leader sequences. Suchleader sequences can act to enhance translation. Translation leaders areknown in the art and include: picornavirus leaders, for example, EMCVleader (Encephalomyocarditis 5′ noncoding region) (Elroy-Stein et al.(1989) Proc. Nat. Acad. Sci. USA 86:6126-6130); potyvirus leaders, forexample, TEV leader (Tobacco Etch Virus) (Allison et al. (1986) Virology154:9-20); MDMV leader (Maize Dwarf Mosaic Virus); human immunoglobulinheavy-chain binding protein (BiP) (Macejak et al. (1991) Nature353:90-94); untranslated leader from the coat protein mRNA of alfalfamosaic virus (AMV RNA 4) (Jobling et al. (1987) Nature 325:622-625);tobacco mosaic virus leader (TMV) (Gallie et al. (1989) MolecularBiology of RNA, pages 237-256); and maize chlorotic mottle virus leader(MCMV) (Lommel et al. (1991) Virology 81:382-385). See also Della-Cioppaet al. (1987) Plant Physiology 84:965-968. Other methods known toenhance translation and/or mRNA stability can also be utilized, forexample, introns, such as the maize Ubiquitin intron (Christensen andQuail (1996) Transgenic Res. 5:213-218; Christensen et al. (1992) PlantMolecular Biology 18:675-689) or the maize Adh1 intron (Kyozuka et al.(1991) Mol. Gen. Genet. 228:40-48; Kyozuka et al. (1990) Maydica35:353-357), and the like.

The DNA constructs of the present invention can also include furtherenhancers, either translation or transcription enhancers, as may berequired. These enhancer regions are well known to persons skilled inthe art, and can include the ATG initiation codon and adjacentsequences. The initiation codon must be in phase with the reading frameof the coding sequence to ensure translation of the entire sequence. Thetranslation control signals and initiation codons can be from a varietyof origins, both natural and synthetic. Translational initiation regionsmay be provided from the source of the transcriptional initiationregion, or from the structural gene. The sequence can also be derivedfrom the regulatory element selected to express the gene, and can bespecifically modified so as to increase translation of the mRNA. It isrecognized that to increase transcription levels enhancers may beutilized in combination with the promoter regions of the embodiments.Enhancers are known in the art and include the SV40 enhancer region, the35S enhancer element, and the like.

In preparing the DNA construct, the various DNA fragments may bemanipulated, so as to provide for the DNA sequences in the properorientation and, as appropriate, in the proper reading frame. Towardthis end, adapters or linkers may be employed to join the DNA fragmentsor other manipulations may be involved to provide for convenientrestriction sites. Restriction sites may be added or removed,superfluous DNA may be removed, or other modifications of the like maybe made to the sequences of the embodiments. For this purpose, in vitromutagenesis, primer repair, restriction, annealing, re-substitutions,for example, transitions and transversions, may be involved.

Reporter genes or selectable marker genes may be included in the DNAconstructs. Examples of suitable reporter genes known in the art can befound in, for example, Jefferson et al. (1991) in Plant MolecularBiology Manual, ed. Gelvin et al. (Kluwer Academic Publishers), pp.1-33; DeWet et al. (1987) Mol. Cell. Biol. 7:725-737; Goff et al. (1990)EMBO J. 9:2517-2522; Kain et al. (1995) BioTechniques 19:650-655; andChiu et al. (1996) Current Biology 6:325-330.

Selectable marker genes for selection of transformed cells or tissuescan include genes that confer antibiotic resistance or resistance toherbicides. Examples of suitable selectable marker genes include, butare not limited to, genes encoding resistance to chloramphenicol(Herrera Estrella et al. (1983) EMBO J. 2:987-992); methotrexate(Herrera Estrella et al. (1983) Nature 303:209-213; Meijer et al. (1991)Plant Mol. Biol. 16:807-820); hygromycin (Waldron et al. (1985) PlantMol. Biol. 5:103-108; Zhijian et al. (1995) Plant Science 108:219-227);streptomycin (Jones et al. (1987) Mol. Gen. Genet. 210:86-91);spectinomycin (Bretagne-Sagnard et al. (1996) Transgenic Res.5:131-137); bleomycin (Hille et al. (1990) Plant Mol. Biol. 7:171-176);sulfonamide (Guerineau et al. (1990) Plant Mol. Biol. 15:127-136);bromoxynil (Stalker et al. (1988) Science 242:419-423); glyphosate (Shawet al. (1986) Science 233:478-481); phosphinothricin (DeBlock et al.(1987) EMBO J. 6:2513-2518).

Other genes that could serve utility in the recovery of transgenicevents but might not be required in the final product would include, butare not limited to, examples such as GUS (b-glucuronidase; Jefferson(1987) Plant Mol. Biol. Rep. 5:387), GFP (green fluorescence protein;Chalfie et al. (1994) Science 263:802), luciferase (Riggs et al. (1987)Nucleic Acids Res. 15(19):8115 and Luehrsen et al. (1992) MethodsEnzymol. 216:397-414), and the maize genes encoding for anthocyaninproduction (Ludwig et al. (1990) Science 247:449).

The nucleic acid molecules of the present invention are useful inmethods directed to expressing a nucleotide sequence in a plant. Thismay be accomplished by transforming a plant cell of interest with a DNAconstruct comprising a promoter identified herein, operably linked to aheterologous nucleotide sequence, and regenerating a stably transformedplant from said plant cell. The methods of the embodiments are alsodirected to inducibly expressing a nucleotide sequence in a plant. Thosemethods comprise transforming a plant cell with a DNA constructcomprising a promoter identified herein that initiates transcription ina plant cell in an inducible manner, operably linked to a heterologousnucleotide sequence, regenerating a transformed plant from said plantcell, and subjecting the plant to the required stimulus to induceexpression.

The DNA construct comprising the particular promoter sequence of thepresent invention operably linked to a nucleotide sequence of interestcan be used to transform any plant. In this manner, geneticallymodified, i.e. transgenic or transformed, plants, plant cells, planttissue, seed, root, and the like can be obtained.

Plant species suitable for the embodiments include, but are not limitedto, corn (Zea mays), Brassica sp. (e.g., B. napus, B. rapa, B. juncea),particularly those Brassica species useful as sources of seed oil,alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale),sorghum and Johnson grass (Sorghum bicolor, Sorghum vulgare, Sorghumhalepense), millet (e.g., pearl millet (Pennisetum glaucum), prosomillet (Panicum miliaceum), foxtail millet (Setaria italica), fingermillet (Eleusine coracana)), sunflower (Helianthus annuus), safflower(Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycinemax), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts(Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum),sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee(Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus),citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camelliasinensis), banana (Musa spp.), avocado (Persea americana), fig (Ficuscasica), guava (Psidium guajava), mango (Mangifera indica), olive (Oleaeuropaea), papaya (Carica papaya), cashew (Anacardium occidentale),macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugarbeets (Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, onion,date, vegetables, ornamentals, grasses (such as, but not limited toswitchgrass (Panicum virgatum), reed canary grass (Phalarisarundinacea), Miscanthus grasses (such as M. sinensis, M. sacchariflorus(Amur silver-grass), M. giganteus), purple false brome (Brachypodiumdistachyon), and giant reedgrass (Arundo donax)), and conifers.

Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g.,Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseoluslimensis), peas (Lathyrus spp.), and members of the genus Cucumis suchas cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon(C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea(Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosaspp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias(Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia(Euphorbia pulcherrima), and chrysanthemum.

Conifers that may be employed in practicing the present inventioninclude, for example, pines such as loblolly pine (Pinus taeda), slashpine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine(Pinus contorta), and Monterey pine (Pinus radiata); Douglas-fir(Pseudotsuga menziesii); Western hemlock (Tsuga canadensis); Sitkaspruce (Picea glauca); redwood (Sequoia sempervirens); true firs such assilver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedarssuch as Western red cedar (Thuja plicata) and Alaska yellow cedar(Chamaecyparis nootkatensis).

As used herein, “vector” refers to a DNA molecule such as a plasmid,cosmid, or bacterial phage for introducing a nucleotide construct, forexample, a DNA construct, into a host cell. Cloning vectors typicallycontain one or a small number of restriction endonuclease recognitionsites at which foreign DNA sequences can be inserted in a determinablefashion without loss of essential biological function of the vector, aswell as a marker gene that is suitable for use in the identification andselection of cells transformed with the cloning vector. Marker genestypically include genes that provide tetracycline resistance, hygromycinresistance, or ampicillin resistance.

The methods of the embodiments involve introducing a nucleotideconstruct into a plant. By “introducing” is intended presenting to theplant the nucleotide construct in such a manner that the construct gainsaccess to the interior of a cell of the plant. The methods of theembodiments do not depend on a particular method for introducing anucleotide construct to a plant, only that the nucleotide constructgains access to the interior of at least one cell of the plant. Methodsfor introducing nucleotide constructs into plants are known in the artincluding, but not limited to, stable transformation methods, transienttransformation methods, and virus-mediated methods.

By “stable transformation” is intended that the nucleotide constructintroduced into a plant integrates into the genome of the plant and iscapable of being inherited by progeny thereof. By “transienttransformation” is intended that a nucleotide construct introduced intoa plant does not integrate into the genome of the plant.

The nucleotide constructs of the embodiments may be introduced intoplants by contacting plants with a virus or viral nucleic acids.Generally, such methods involve incorporating a nucleotide construct ofthe embodiments within a viral DNA or RNA molecule. Methods forintroducing nucleotide constructs into plants and expressing a proteinencoded therein, involving viral DNA or RNA molecules, are known in theart. See, for example, U.S. Pat. Nos. 5,889,191, 5,889,190, 5,866,785,5,589,367, and 5,316,931; herein incorporated by reference.

Transformation protocols as well as protocols for introducing nucleotidesequences into plants may vary depending on the type of plant or plantcell, i.e., monocot or dicot, targeted for transformation. Suitablemethods of introducing nucleotide sequences into plant cells andsubsequent insertion into the plant genome include microinjection(Crossway et al. (1986) Biotechniques 4:320-334), electroporation (Riggset al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606,Agrobacterium-mediated transformation (U.S. Pat. Nos. 5,981,840 and5,563,055), direct gene transfer (Paszkowski et al. (1984) EMBO J.3:2717-2722), and ballistic particle acceleration (see, for example,U.S. Pat. Nos. 4,945,050; 5,879,918; 5,886,244; 5,932,782; Tomes et al.(1995) in Plant Cell, Tissue, and Organ Culture: Fundamental Methods,ed. Gamborg and Phillips (Springer-Verlag, Berlin); and McCabe et al.(1988) Biotechnology 6:923-926). Also see Weissinger et al. (1988) Ann.Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science andTechnology 5:27-37 (onion); Christou et al. (1988) Plant Physiol.87:671-674 (soybean); McCabe et al. (1988) Bio/Technology 6:923-926(soybean); Finer and McMullen (1991) In Vitro Cell Dev. Biol.27P:175-182 (soybean); Singh et al. (1998) Theor. Appl. Genet.96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740(rice); Klein et al. (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309(maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); U.S. Pat.Nos. 5,240,855; 5,322,783 and 5,324,646; Klein et al. (1988) PlantPhysiol. 91:440-444 (maize); Fromm et al. (1990) Biotechnology 8:833-839(maize); Hooykaas-Van Slogteren et al. (1984) Nature (London)311:763-764; U.S. Pat. No. 5,736,369 (cereals); Bytebier et al. (1987)Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet et al.(1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman etal. (Longman, New York), pp. 197-209 (pollen); Kaeppler et al. (1990)Plant Cell Reports 9:415-418 and Kaeppler et al. (1992) Theor. Appl.Genet. 84:560-566 (whisker-mediated transformation); D'Halluin et al.(1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) PlantCell Reports 12:250-255 and Christou and Ford (1995) Annals of Botany75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-750(maize via Agrobacterium tumefaciens); all of which are hereinincorporated by reference.

The cells that have been transformed may be grown into plants inaccordance with conventional ways. See, for example, McCormick et al.(1986) Plant Cell Reports 5:81-84. These plants may then be grown, andeither pollinated with the same transformed strain or different strains,and the resulting hybrid having inducible expression of the desiredphenotypic characteristic identified. Two or more generations may begrown to ensure that inducible expression of the desired phenotypiccharacteristic is stably maintained and inherited and then seedsharvested to ensure inducible expression of the desired phenotypiccharacteristic has been achieved. Thus as used herein, “transformedseeds” refers to seeds that contain the nucleotide construct stablyintegrated into the plant genome.

There are a variety of methods for the regeneration of plants from planttissue. The particular method of regeneration will depend on thestarting plant tissue and the particular plant species to beregenerated. The regeneration, development and cultivation of plantsfrom single plant protoplast transformants or from various transformedexplants is well known in the art (Weissbach and Weissbach, (1988) In.:Methods for Plant Molecular Biology, (Eds.), Academic Press, Inc., SanDiego, Calif.). This regeneration and growth process typically includesthe steps of selection of transformed cells, culturing thoseindividualized cells through the usual stages of embryonic developmentthrough the rooted plantlet stage. Transgenic embryos and seeds aresimilarly regenerated. The resulting transgenic rooted shoots arethereafter planted in an appropriate plant growth medium such as soil.Preferably, the regenerated plants are self-pollinated to providehomozygous transgenic plants. Otherwise, pollen obtained from theregenerated plants is crossed to seed-grown plants of agronomicallyimportant lines. Conversely, pollen from plants of these important linesis used to pollinate regenerated plants. A transgenic plant of thepresent invention containing a desired polypeptide is cultivated usingmethods well known to one skilled in the art.

The embodiments provide compositions for screening compounds thatmodulate expression within plants. The vectors, cells, and plants can beused for screening candidate molecules for agonists and antagonists ofthe promoter disclosed herein. For example, a reporter gene can beoperably linked to a promoter and expressed as a transgene in a plant.Compounds to be tested are added and reporter gene expression ismeasured to determine the effect on promoter activity.

The following examples are offered by way of illustration and not by wayof limitation.

Experimental

The present invention is further defined in the following Examples, inwhich parts and percentages are by weight and degrees are Celsius,unless otherwise stated. Techniques in molecular biology were typicallyperformed as described in Ausubel or Sambrook, supra. It should beunderstood that these Examples, while indicating certain embodiments ofthe invention, are given by way of illustration only. From the abovediscussion and these Examples, one skilled in the art can ascertain theessential characteristics of this invention, and without departing fromthe spirit and scope thereof, can make various changes and modificationsof the embodiments to adapt it to various usages and conditions. Thus,various modifications of the embodiments in addition to those shown anddescribed herein will be apparent to those skilled in the art from theforegoing description. Such modifications are also intended to fallwithin the scope of the appended claims.

The disclosure of each reference set forth herein is incorporated hereinby reference in its entirety.

EXAMPLE 1 Expression Pattern of the PsaH Gene

Evidence that the promoter of SEQ ID NO: 1 was a green tissue-preferredpromoter was obtained using Lynx Massively Parallel Signature Sequencingtechnology (MPSS) (see Brenner S, et al. (2000) Nature Biotechnology18:630-634, Brenner S et al. (2000) Proc Natl Acad Sci USA97:1665-1670). This technology involves the generation of 17 basesignature tags from mRNA samples that have been reverse transcribed. Thetags are simultaneously sequenced and assigned to genes or ESTs. Theabundance of these tags is given a number value that is normalized toparts per million (PPM) which then allows the tag expression, or tagabundance, to be compared across different tissues. Thus, the MPSSplatform can be used to determine the expression pattern of a particulargene and its expression level in different tissues.

During a search of the MPSS database, the gene associated with thepromoter of SEQ ID NO: 1 was found to be expressed primarily in leaf andwhorl tissues and secondarily in husk, silk, tassel and stalk tissues(FIG. 1). The high levels of expression in leaf tissue suggested thatthe promoter could be a suitable candidate to drive transgeneexpression. Such transgenes include insecticidal genes, biotic andabiotic stress-resistance genes (drought, salt, cold, etc), andagronomic trait genes.

EXAMPLE 2 Isolation of the PsaH Promoter

The tag generated from MPSS identified an EST from a proprietarydatabase. An alignment of this EST with other overlapping ESTs revealeda consensus sequence. A query against the GenBank database revealedhomology to a Zea mays photosystem I complex PsaH subunit precursor.

Sequence upstream of the consensus sequence was identified in a searchagainst the Genome Sequence Survey (GSS) database of NCBI. Alignment ofGSS and EST sequences was done to delineate the ORF and 5′ flankingregion. The region upstream of the most 5′ ORF was selected as thepromoter-containing region. Oligonucleotides were designed to PCRamplify approximately 1442 bp of genomic DNA from the maize inbred line,B73. Restriction endonuclease enzyme sites were introduced during PCR toaid in cloning: XbaI at the 5′ end and BamHI at the 3′ end. The 5′(forward) and 3′ (reverse) oligonucleotide primers are set forth in SEQID NO: 2 and SEQ ID NO: 3, respectively. PCR was performed usingPLATINUM Pfx DNA polymerase (Invitrogen) as directed by the manufacturerusing 100 ng template and 1.25 Units enzyme in a 50 μl reaction volume,with the modification that the final buffer concentration was 2× insteadof 1×. PCR conditions were as follows: 3-step cycling process: 94° C.for 5 minutes; 35 cycles at 94° C. for 15 seconds, 60° C. for 45seconds, and 68° C. for 2 minutes; 68° C. for 10 minutes; hold at 4° C.The resulting fragment was isolated and extracted from a 1% agarose gelusing a Qiagen gel extraction kit and cloned into PCR-Blunt(Invitrogen), as directed. Clones containing the 1442 bp promoterfragment were identified by isolating the plasmid DNA from antibioticresistant colonies and performing restriction endonuclease digests onthe DNA. Positive clones were sequenced using M13F and M13R primers, aswell as oligos as set forth in SEQ ID NOs: 4, 5, 6, and 7. The 1442 bpDNA fragment was then cloned into a plant expression vector in front ofthe B-glucuronidase coding region for introduction into plant cells.

In silico analysis of the sequence for motifs important in expressionrevealed several gccac and gggcc pentamers. These sequences have beenidentified as overrepresented sequences in phyA-induced promoters ofArabidopsis (Hudson M E & Quail P H. (2003) Plant Physiol. 133(4)1605-16). Their function in expression is unknown. A canonical TATA boxwas not identified in the PsaH sequence, although most maize promoterslack a discernible TATA box.

EXAMPLE 3 Promoter Activity of PsaH

To demonstrate that the DNA isolated as the PsaH promoter functions as apromoter, transient particle bombardment assays were performed. Theseassays provided a rapid assessment of whether a DNA fragment is able todirect gene expression.

The isolated DNA was PCR amplified from genomic DNA and cloned into anexpression vector behind the B-glucuronidase (GUS) gene, with andwithout the Adh1 intron 1. The Adh1 intron was included for the purposeof determining the promoter-intron effect on expression levels as some5′ proximal introns, such as the Adh1 intron 1, have been shown toenhance the expression of foreign genes in cereal cells through aprocess called intron mediated enhancement. (Callis et al. (1987) GenesDev. 11: 1183-1200; Kyozuka et al. (1990) Maydica 35:353-357).

Biolistic bombardment of 3-day-old maize seedlings with the PsaH:Adh1expression cassette resulted in numerous GUS staining foci on thecoleoptile (>30 foci/coleoptile). The expression cassette without theAdh1 intron resulted in no GUS staining foci. The level of GUS stainingfor the PsaH:Adh1 cassette was comparable to, but below a controlcassette that consisted of the strong, constitutive promoter, Ubi-1,directing GUS expression.

Materials and Methods Utilized for the Biolistic Transient ExpressionAssay

B73 seeds were placed along one edge of a piece of germination paperthat had been soaked in a solution of 7% sucrose. An additional piece ofgermination paper, identical in size to the first, was also soaked in 7%sucrose and was used to overlay the kernels. The germinationpaper-kernel-germination paper sandwich was subsequently rolled andplaced into a beaker of 7% sucrose solution, such that the solutionwould wick up the paper to the kernels at the top of the roll. Thisallowed for straight root growth. Kernels were permitted to germinateand develop for 2-3 days in the dark at 27-28° C. prior to bombardment:The sheath covering the coleoptile was removed and the seedlings wereplaced in a sterile petri dish (60 mm) on a layer filter paper moistenedwith distilled water. Two seedlings per plate were arranged in oppositeorientations and anchored to the filter paper with a 0.5% agarosesolution.

DNA/gold particle mixtures were prepared for bombardment in thefollowing method: 60 mg of 0.6-1.0 micron gold particles were pre-washedwith ethanol, rinsed with sterile distilled H₂O, and resuspended in atotal of 1 mL of sterile H₂O. DNA was precipitated onto the surface ofthe gold particles by combining, in the following order, 50 μL ofpre-washed 0.6 μM gold particles, 5-10 μg of test DNA, 50 μL 2.5 M CaCl₂and 25 μL of 0.1 M spermidine. The solution was immediately vortexed for3 minutes and centrifuged briefly to pellet the DNA/gold particles. TheDNA/gold was washed once with 500 μL of 100% ethanol and suspended in afinal volume of 50 μL of 100% ethanol. The DNA/gold solution wasincubated at −20° C. for at least 60 minutes prior to applying 6 μL ofthe DNA/gold mixture onto each MYLAR macrocarrier.

Seedlings prepared as indicated above were bombarded twice using thePDS-1000/He gun at 1100 psi under 27-28 inches of Hg vacuum. Thedistance between macrocarrier and stopping screen was between 6 and 8cm. Plates were incubated in sealed containers for 18-24 h in the darkat 27-28° C. following bombardment.

The bombarded seedlings were assayed for transient GUS expression byimmersing the seedlings in 10-15 mL of GUS assay buffer containing 100mM NaH₂PO₄—H₂O (pH 7.0), 10 mM EDTA, 0.5 mM K₄Fe(CN)₆-3H₂O, 0.1% TritonX-100 and 2 mM 5-bromo-4-chloro-3-indoyl glucuronide. The tissues wereincubated in the dark for 24 h at 37° C. Replacing the GUS stainingsolution with 100% ethanol stopped the assay. GUS expression/stainingwas visualized under a microscope.

EXAMPLE 4 Expression Pattern of PCO-rtp

Stable transformed plants were created using Agrobacterium protocols(detailed in Example 5) to allow for a more detailed characterization ofexpression pattern and expression level directed by the promoter. ThePsaH promoter (SEQ ID NO:1) was operably connected to the GUS gene(abbreviated as PsaH:GUS) or to the first intron from the Adh1 gene andthen the GUS gene (abbreviated as PsaH:Adh1:GUS) so that PsaH promoteractivity could be qualitatively detected by histochemically stainingtissue for GUS activity or be quantitated using GUS fluorometric assays.The Adh1 intron was included once again for the purpose of determiningwhether its presence would increase expression when linked to the PsaHpromoter.

A total of 19 events were regenerated for the PsaH:GUS and thePsaH:Adh1:GUS plasmids. Plants were grown under greenhouse conditionsuntil they reached a growth stage ranging from V4 to V6. Vegetativegrowth stages are determined by the number of collared leaves on theplant. Therefore, a plant at V6 stage has 6 fully collared leaves. Leafand root tissue were sampled from each plant and histochemically assayedfor GUS activity. Tassel, silk and pollen samples also were collectedand histochemically assayed for GUS activity when the plants reached thereproductive developmental growth stages of R1 and R2. These stages arenoted by ear silking and pollen-shed.

Results showed that GUS was expressed in green tissues (Table 1). Boththe PsaH:GUS and the PsaH:Adh1:GUS cassettes drove expression invascular and non-vascular tissues of the leaves. In the tassels, bothcassettes directed expression in the glumes, rachis and rachilla. Noexpression was detected in pollen. Expression also was not detected insilks and roots; this includes both nodal and lateral roots. Aquantitative comparison of expression between leaves and roots showed nomeasurable expression in the root tip and mature region of the root.Expression in leaves showed a 2-fold difference in expression with thePsaH:Adh1:GUS cassette having the lower expression level.

TABLE 1 Plant Expression Results for the PsaH Promoter V5-V6 Root: Rootmature R1-R2 Leaf Tip region Tassel Silk Pollen PsaH (SEQ ID NO: 1) +++− − +++ − − PsaH:ADH ++ − − ++ − − (SEQ ID NO: 1) negative control − − −− − −

Histochemical Staining of Plant Tissues for GUS Activity

Detection of GUS activity was accomplished by placing tissue fromtransformed plants into 12-well or 6-well plates containing 2 to 5 mLGUS assay buffer (assay buffer recipe described in Example 3). Plateswere placed under house vacuum for 10 min, and then incubated overnightat 37° C. Tissue was cleared of pigmentation with 1 to 3 successive 12hour incubations in 100% ethanol at room temperature. The tissues werestored in 70% ethanol at 4° C.

EXAMPLE 5 Agrobacterium-Mediated Transformation of Maize andRegeneration of Transgenic Plants

For Agrobacterium-mediated transformation of maize with a promotersequence of the embodiments, the method of Zhao was employed (See: U.S.Pat. No. 5,981,840, (hereinafter the '840 patent) and PCT patentpublication WO98/32326, the contents of both of which are herebyincorporated by reference).

Agrobacterium were grown on a master plate of 800 medium and cultured at28° C. in the dark for 3 days, and thereafter stored at 4° C. for up toone month. Working plates of Agrobacterium were grown on 810 mediumplates and incubated in the dark at 28° C. for one to two days.

Briefly, embryos were dissected from fresh, sterilized corn ears andkept in 561Q medium until all required embryos were collected. Embryoswere then contacted with an Agrobacterium suspension prepared from theworking plate, in which the Agrobacterium contained a plasmid comprisingthe promoter sequence of the embodiments. The embryos were co-cultivatedwith the Agrobacterium on 562P plates, with the embryos placed axis downon the plates, as per the '840 patent protocol.

After one week on 562P medium, the embryos were transferred to 563Omedium. The embryos were subcultured on fresh 563O medium at 2 weekintervals and incubation was continued under the same conditions. Callusevents began to appear after 6 to 8 weeks on selection.

After the calli had reached the appropriate size, the calli werecultured on regeneration (288W) medium and kept in the dark for 2-3weeks to initiate plant regeneration. Following somatic embryomaturation, well-developed somatic embryos were transferred to mediumfor germination (272V) and moved to a lighted culture room.Approximately 7-10 days later, developing plantlets were transferred to272V hormone-free medium in tubes for 7-10 days until plantlets werewell established. Plants were then transferred to inserts in flats(equivalent to 2.5″ pot) containing potting soil and grown for 1 week ina growth chamber, subsequently grown an additional 1-2 weeks in thegreenhouse, then transferred to classic 600 pots (1.6 gallon) and grownto maturity.

Media Used in Agrobacterium-Mediated Transformation and Regeneration ofTransgenic Maize Plants:

561Q medium comprises 4.0 g/L N6 basal salts (SIGMA C-1416), 1.0 mL/LEriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/L thiamine HCl, 68.5g/L sucrose, 36.0 g/L glucose, 1.5 mg/L 2,4-D, and 0.69 g/L L-proline(brought to volume with dl H₂O following adjustment to pH 5.2 with KOH);2.0 g/L GELRITE (added after bringing to volume with dl H₂O); and 8.5mg/L silver nitrate (added after sterilizing the medium and cooling toroom temperature).

800 medium comprises 50.0 mL/L stock solution A and 850 mL dl H₂O, andbrought to volume minus 100 mL/L with dl H₂O, after which is added 9.0 gof phytagar. After sterilizing and cooling, 50.0 mL/L stock solution Bis added, along with 5.0 g of glucose and 2.0 mL of a 50 mg/mL stocksolution of spectinomycin. Stock solution A comprises 60.0 g of dibasicK₂HPO₄ and 20.0 g of monobasic sodium phosphate, dissolved in 950 mL ofwater, adjusted to pH 7.0 with KOH, and brought to 1.0 L volume with dlH₂O. Stock solution B comprises 20.0 g NH₄Cl, 6.0 g MgSO₄.7H₂O, 3.0 gpotassium chloride, 0.2 g CaCl₂, and 0.05 g of FeSO₄.7H₂O, all broughtto volume with dl H₂O, sterilized, and cooled.

810 medium comprises 5.0 g yeast extract (Difco), 10.0 g peptone(Difco), 5.0 g NaCl, dissolved in dl H₂O, and brought to volume afteradjusting pH to 6.8. 15.0 g of bacto-agar is then added, the solution issterilized and cooled, and 1.0 mL of a 50 mg/mL stock solution ofspectinomycin is added.

562P medium comprises 4.0 g/L N6 basal salts (SIGMA C-1416), 1.0 mL/LEriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/L thiamine HCl, 30.0g/L sucrose, and 2.0 mg/L 2,4-D (brought to volume with dl H₂O followingadjustment to pH 5.8 with KOH); 3.0 g/L GELRITE (added after bringing tovolume with dl H₂O); and 0.85 mg/L silver nitrate and 1.0 mL of a 100 mMstock of acetosyringone (both added after sterilizing the medium andcooling to room temperature).

563O medium comprises 4.0 g/L N6 basal salts (SIGMA C-1416), 1.0 mL/LEriksson's Vitamin Mix (1000× SIGMA-1511), 0.5 mg/L thiamine HCl, 30.0g/L sucrose, 1.5 mg/L 2,4-D, 0.69 g L-proline, and 0.5 g MES buffer(brought to volume with dl H₂O following adjustment to pH 5.8 with KOH).Then, 6.0 g/L ULTRAPURE agar-agar (EM Science) is added and the mediumis sterilized and cooled. Subsequently, 0.85 mg/L silver nitrate, 3.0 mLof a 1 mg/mL stock of Bialaphos, and 2.0 mL of a 50 mg/mL stock ofcarbenicillin are added.

288 W comprises 4.3 g/L MS salts (GIBCO 11117-074), 5.0 mL/L MS vitaminsstock solution (0.100 g nicotinic acid, 0.02 g/L thiamine HCl, 0.10 g/Lpyridoxine HCl, and 0.40 g/L Glycine brought to volume with polished D-IH₂O) (Murashige and Skoog (1962) Physiol. Plant. 15:473), 100 mg/Lmyo-inositol, 0.5 mg/L zeatin, and 60 g/L sucrose, which is then broughtto volume with polished D-I H₂O after adjusting to pH 5.6. Following,6.0 g/L of ULTRAPURE agar-agar (EM Science) is added and the medium issterilized and cooled. Subsequently, 1.0 mL/L of 0.1 mM abscisic acid;1.0 mg/L indoleacetic acid and 3.0 mg/L Bialaphos are added, along with2.0 mL of a 50 mg/mL stock of carbenicillin.

Hormone-free medium (272V) comprises 4.3 g/L MS salts (GIBCO 11117-074),5.0 mL/L MS vitamins stock solution (0.100 g/L nicotinic acid, 0.02 g/Lthiamine HCl, 0.10 g/L pyridoxine HCl, and 0.40 g/L Glycine brought tovolume with polished dl H₂O), 0.1 g/L myo-inositol, and 40.0 g/L sucrose(brought to volume with polished dl H₂O after adjusting pH to 5.6); and6 g/L Bacto-agar (added after bringing to volume with polished dl H₂O),sterilized and cooled to 60° C.

All publications and patent applications mentioned in the specificationare indicative of the level of those skilled in the art to which thisinvention pertains. All publications and patent applications are hereinincorporated by reference to the same extent as if each individualpublication or patent application was specifically and individuallyindicated to be incorporated by reference.

Although the foregoing invention has been described in some detail byway of illustration and example for purposes of clarity ofunderstanding, it will be obvious that certain changes and modificationsmay be practiced within the scope of the appended claims.

1. An isolated nucleic acid molecule comprising a nucleotide sequenceselected from the group consisting of: a) a nucleotide sequencecomprising the sequence set forth in SEQ ID NO:1, or a complementthereof; b) a nucleotide sequence comprising the plant promotersequences of the plasmids deposited as Patent Deposit No. NRRL-B-50179or a complement thereof; and c) a nucleotide sequence comprising afragment of a) or b), wherein said fragment initiates transcription in aplant cell.
 2. A DNA construct comprising a nucleotide sequence of claim1 operably linked to a heterologous nucleotide sequence of interest. 3.A vector comprising the DNA construct of claim
 2. 4. A plant cell havingstably incorporated into its genome the DNA construct of claim
 2. 5. Theplant cell of claim 4, wherein said plant cell is from a monocot.
 6. Theplant cell of claim 5, wherein said monocot is maize.
 7. The plant cellof claim 4, wherein said plant cell is from a dicot.
 8. A plant havingstably incorporated into its genome the DNA construct of claim
 2. 9. Theplant of claim 8, wherein said plant is a monocot.
 10. The plant ofclaim 9, wherein said monocot is maize.
 11. The plant of claim 8,wherein said plant is a dicot.
 12. A transgenic seed of the plant ofclaim 8, wherein said seed comprises the DNA construct.
 13. The plant ofclaim 8, wherein the heterologous nucleotide sequence of interestencodes a gene product that confers herbicide, salt, cold, drought,pathogen, or insect resistance.
 14. A method for expressing a nucleotidesequence in a plant, said method comprising introducing into a plant aDNA construct, said DNA construct comprising a promoter and operablylinked to said promoter a heterologous nucleotide sequence of interest,wherein said promoter comprises a nucleotide sequence selected from thegroup consisting of: a) a nucleotide sequence comprising the sequenceset forth in SEQ ID NO:1, or a complement thereof; b) a nucleotidesequence comprising the plant promoter sequences of the plasmidsdeposited as Patent Deposit No. NRRL-B-50179 or a complement thereof;and c) a nucleotide sequence comprising a fragment of a) or b), whereinsaid fragment initiates transcription in a plant cell.
 15. The method ofclaim 14, wherein said plant is a dicot.
 16. The method of claim 14,wherein said plant is a monocot.
 17. The method of claim 16, whereinsaid monocot is maize.
 18. The method of claim 14, wherein theheterologous nucleotide sequence encodes a gene product that confersherbicide, salt, cold, drought, pathogen, or insect resistance.
 19. Themethod of claim 14, wherein said heterologous nucleotide sequence ofinterest is selectively expressed in the green tissues of a plant.
 20. Amethod for expressing a nucleotide sequence in a plant cell, said methodcomprising introducing into a plant cell a DNA construct comprising apromoter operably linked to a heterologous nucleotide sequence ofinterest, wherein said promoter comprises a nucleotide sequence selectedfrom the group consisting of: a) a nucleotide sequence comprising thesequence set forth in SEQ ID NO:1, or a complement thereof; b) anucleotide sequence comprising the plant promoter sequences of theplasmids deposited as Patent Deposit No. NRRL-B-50179 or a complementthereof; and c) a nucleotide sequence comprising a fragment of a) or b),wherein said fragment initiates transcription in a plant cell.
 21. Themethod of claim 20, wherein said plant cell is from a monocot.
 22. Themethod of claim 21, wherein said monocot is maize.
 23. The method ofclaim 20, wherein said plant cell is from a dicot.
 24. The method ofclaim 20, wherein the heterologous nucleotide sequence encodes a geneproduct that confers herbicide, salt, cold, drought, pathogen, or insectresistance.
 25. A method for selectively expressing a nucleotidesequence in a plant's green tissues, said method comprising introducinginto a plant cell a DNA construct, and regenerating a transformed plantfrom said plant cell, said DNA construct comprising a promoter and aheterologous nucleotide sequence operably linked to said promoter,wherein said promoter comprises a nucleotide sequence selected from thegroup consisting of: a) a nucleotide sequence comprising the sequenceset forth in SEQ ID NO:1, or a complement thereof; b) a nucleotidesequence comprising the plant promoter sequences of the plasmidsdeposited as Patent Deposit No. NRRL-B-50179 or a complement thereof;and c) a nucleotide sequence comprising a fragment of a) or b), whereinsaid fragment initiates transcription in a plant cell.
 26. The method ofclaim 25, wherein expression of said heterologous nucleotide sequencealters the phenotype of said plant.
 27. The method of claim 25, whereinthe plant is a monocot.
 28. The method of claim 27, wherein the monocotis maize.
 29. The method of claim 25, wherein the plant is a dicot. 30.The method of claim 25, wherein the heterologous nucleotide sequenceencodes a gene product that confers herbicide, salt, pathogen, or insectresistance.